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Abstract 

Background: A germline, variant in the BRCAl 3'UTR (rs8176318) was previously shown to predict breast and 
ovarian cancer risk in women from high-risk families, as well as increased risk of triple negative breast cancer. Here, 
we tested the hypothesis that this variant predicts tumor biology, like other 3'UTR mutations in cancer. 

Methods: The impact of the /-3'UTR-variant on BRCAl gene expression, and altered response to external 
stimuli was tested in vitro using a luciferase reporter assay. Gene expression was further tested in vivo by 
immunoflourescence staining on breast tumor tissue, comparing triple negative patient samples with the variant 
(TG or TV) or non-variant (GG) BRCAl 3'UTR. To determine the significance of the variant on clinically relevant 
endpoints, a comprehensive collection of West-Irish breast cancer patients were tested for the variant. Finally, an 
association of the variant with breast screening clinical phenotypes was evaluated using a cohort of women from 
the High Risk Breast Program at the University of Vermont. 

Results: Luciferase reporters with the 6RC/\/-3'UTR-variant (T allele) displayed significantly lower gene expression, as 
well as altered response to external hormonal stimuli, compared to the non-variant 3'UTR (G allele) in breast cancer 
cell lines. This was confirmed clinically by the finding of reduced BRCAl gene expression in triple negative samples 
from patients carrying the homozygous TT variant, compared to non-variant patients. The 6/?C/4/-3'UTR-variant 
(TG or TO also associated with a modest increased risk for developing breast cancer in the West-Irish cohort 
(0R= 1.4, 95% CI 1.1-1.8, p = 0.033). More importantly, patients with the /-3'UTR-variant had a 4-fold increased 
risk of presenting with Stage IV disease (p = 0.01 8, OR = 3.37, 95% CI 1 .3-1 1 .0). Supporting that this finding is due to 
tumor biology, and not difficulty screening, obese women with the fi/?C/4/-3'UTR-variant had significantly less dense 
breasts (p = 0.0398) in the Vermont cohort. 

Conclusion: A variant in the 3'UTR of BRCAl is functional, leading to decreased BRCAl expression, modest 
increased breast cancer risk, and most importantly, presentation with stage IV breast cancer, likely due to aggressive 
tumor biology. 

Keywords: 6RC/I ?-3'UTR-variant, Mutation, Breast cancer. Stage IV breast cancer. Metastatic breast cancer, Biomarker, 
Diagnostic marker 
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Background 

Breast cancer is the third most common form of cancer, 
with almost 1.5 million women in the world diagnosed 
with the disease in 2010 alone [1,2]. The extensive use of 
mammography has resulted in a large proportion of 
breast cancer cases being detected at an earlier stage, 
resulting in increased survival and outcome [3]. How- 
ever, approximately 3-6% of patients continue to present 
with metastatic disease at diagnosis throughout the US 
and Europe [4,5]. As a significant number of cases present 
with metastatic disease when the primary tumor is not lo- 
cally advanced [6], one can hypothesize that there is het- 
erogeneity in tumor biology between patients, versus a 
failure of screening. Despite the presence of targeted thera- 
peutics for hormone receptor sensitive and HER2 over- 
expressing breast cancers, treatment of metastatic disease 
remains incurable. Therefore, identification of women with 
a predisposition to develop tumors that are more likely to 
metastasize is critical to help develop improved prevention 
and screening strategies for those individuals. 

The Breast Cancer 1, early onset gene [BRCAl) located 
on chromosome 17q21.31 [7,8] encodes a tumor suppres- 
sor that plays a critical role in the DNA damage response 
and repair pathways [9,10]. Germline variants in the open- 
reading-frame of BRCAl confer a mean risk of 54% and 
39% for developing hereditary breast and ovarian cancer 
(respectively) by age 70 [11-14]. However, BRCAl open- 
reading-frame variants only account for a small portion of 
hereditary breast cancer cases that occur primarily in 
young, premenopausal patients [15]. Therefore, the search 
for additional germline variants, outside of the BRCAl 
open-reading-frame predicting increased breast cancer 
risk has been undertaken. Such variants in the BRCAl 
3'UTR have recently been identified and were first im- 
plicated in breast and ovarian cancer susceptibility in 
high-risk families [16]. Two variants 5711 + 421 TTTand 
5711 + 1286 T/T (located in the BRCAl 3'UTR) are as- 
sociated with cancer risk in Thai women from breast 
and ovarian cancer families (OR = 3.0). Independent 
evaluation of the 5711 + 421 T/T variant (referred to 
here as rs8176318 or the B7?CAi -3'UTR- variant) revealed 
significant variation in baseline frequency by ethnicity, 
with a documented minor allele frequency in Irish popula- 
tions of approximately 0.28 [17]. Homozygous G > T vari- 
ants were found to be associated with increased risk of 
breast cancer in Afirican American women (OR = 9.48, 
95% CI 1.01-88.80), and were specifically associated with 
the development of triple negative breast cancer (OR = 
12.19, 95% CI 1.29-115.21) [17]. This data suggests that 
the B7?CAJ-3'UTR-variant not only confers an increased 
risk of developing breast cancer, but may also be associ- 
ated with tumor biology, since the propensity to develop 
triple negative breast cancer is higher than that of the 
other subtypes. One could hypothesize from these findings 



that the 57?CAi -3'UTR-variant functions similarly to 
that of canonical BRCAl open-reading-frame variants, 
which are more commonly associated with development 
of triple negative breast cancer as opposed to the other 
subtypes [18-20]. 

Open reading frame variants in BRCAl have not clearly 
been associated with unique tumor biology, but only have 
been predictive of response to therapeutic agents that take 
advantage of their inherent DNA repair defects [21]. In 
contrast, 3'UTR variants in cancer- associated genes have 
been shown to predict both altered response to specific 
therapies, as well as inherent differences in tumor biology. 
This is likely due to the fact that these variants are in re- 
gions of regulatory elements that control the nature and 
timing of gene expression, and their effects are only mani- 
fest under particular extracellular and/or intracellular stim- 
uli (for review see ([22]). One mechanism for regulation of 
these variants is by trans-acting factors such as miRNAs, 
which are rapidly altered by external factors such as geno- 
toxic stress [23] and estrogen receptor signaling [24] . 

Based on evidence of the biological function of other 
3'UTR variants in cancer, and the association of the 
fi7?CAJ-3'UTR-variant with breast cancer risk in two pre- 
vious studies [16,17], we sought to investigate the impact 
of this variant on BRCAl expression and its association 
with tumor biology as seen in clinical presentation in a 
clinically well-annotated breast cancer population. 

Methods 

Luciferase reporter assay 

Luciferase reporters containing either the rs8176318 
G-allele or T-allele were generated by PCR amplification 
of the BRCAl 3'UTR loci from HMEC genomic DNA 
(heterozygous for the £7?CAi-3'UTR-variant) using the 
following DNA oligonucleotides: 

BRCAl forward 5' ATGACTCGAGCTGCAGCCAGC 
CACAGGTACAGAGCCACAG 3' 
BRCAl reverse 5' ATGAGCGGCCGCGTGTTTGCT 
ACCAAGTTTATTTGCAGTG 3' 

PCR amplicons were subcloned into the Xhol and NotI 
sites (underlined) of the psiCHECK2 dual luciferase vector 
(Progema). Constructs were sequence verified to confirm 
that the only difference in the BRCAl 3'UTR was the 
rs8176318 variant. 

MCF-7, MDA-MB-231, MDA-MB-361, MDA-MB-468, 
Hs 578 T and BT-20 cells were purchased from the ATCC 
and grown at 37°C and 5% CO2 according to the manufac- 
turer's protocol. MCF-7 and BT-20 cells were cultured 
using MEM (GIBCO) supplemented with 10% fetal bo- 
vine serum (GIBCO) and 100 ug/ml penicillin, 100 U 
streptomycin. MDA-MB-231, MDA-MB-361 and MDA- 
MB-468 cells were cultured using Leibovitz's L-15 (GIBCO) 
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supplemented with 10% fetal bovine serum and 100 ug/ml 
penicillin, 100 U streptomycin. Hs5788T cells were cultured 
in DMEM (GIBCO) supplemented with 10% fetal bovine 
serum and 100 ug/ml penicillin, 100 U streptomycin. 

Cells in log-growth phase were transfected with either 
the G-allele or T-allele expressing luciferase reporters 
(100 ng) using Lipfectamine 2000 (Invitrogen) according 
to the manufacturer's protocol. Following a 16-hour incu- 
bation the cells were lysed and analyzed for dual luciferase 
activities by quantitative titration using the dual luciferase 
assay kit (Promega). Renilla luciferase was normalized to 
firefly luciferase. Graphed is the mean ± standard devi- 
ation (SD) of 3 independent experiments. Statistical sig- 
nificance was determined by student's t-test (1-tailed, 
paired t-test). A p-value of less than 0.05 was considered 
statistically significant. 

Immunofluorescence staining of BRCA1 in tumor tissue 

BRCAl protein expression was analyzed from tumor tis- 
sue derived from the triple negative breast cancer cohort 
subset with corresponding fii?CAi-3'UTR-variant geno- 
type information, using an immunofluorescent platform, 
AQUA™, on tissue microarrays (TMAs) of tumor cores. 
BRCAl protein was assessed using monoclonal MS 110 
Ab-1 anti-BRCAl (Calbiochem) [25-27] and rabbit poly- 
clonal anticytokeratin (DAKO), at dilutions of 1:100 and 
1:200 respectively in 0.3% BSA/TBS buffer for 1 h at SyC. 
AQUA has been described previously [28,29] . 

Estrogen withdrawal assay 

MCF-7 cells cultured in phenol-red free MEM (GIBCO) 
containing 5% fetal bovine serum and 100 ug/ml penicil- 
lin, 100 U streptomycin, were treated with either 100 nM 
Fuvestrant (Sigma 14409) or (B-Estradiol (Sigma E8875). 
Following a 48-hour incubation, the cells were transfected 
with luciferase reporters (100 ng) harboring either the 
BRCAl G-allele or T-allele 3'UTR using Lipofectamine 
2000. After a 16-hour incubation the cells were lysed and 
analyzed for dual luciferase activities by quantitative titra- 
tion. Renilla luciferase was normalized to firefly luciferase. 
Graphed is the mean ± SD of 3 independent experiments, 
preformed in triplicate. Statistical significance was deter- 
mined by student's t-test (1 -tailed, paired t-test). A p-vahie 
of less than 0.05 was considered statistically significant. 

Total RNA was isolated from cell lysates by Trizol ex- 
traction as previously described [30] . cDNA was generated 
using iScript cDNA Synthesis Kit (Bio-Rad). Target mRNA 
was amplified by qPCR using iTaq SYBR Green Supermix 
with ROX (Bio-Rad) on a 7900HT Fast Real-Time PGR 
System (Applied Biosystems) using the following DNA 
oligonucleotide primers: 

Actin forward 5' AGAAAATCTGGCACCACACC 3' 
Actin reverse 5' AGAGGCGTACAGGGATAGCA 3' 



GREBl forward 5' GTGGTAGCCGAGTGGACAAT 3' 
GREBl reverse 5' TGTGCATTACGGACCAGGTA 3' 
TFFl forward 5' CACCATGGAGAACAAGGTGA 3' 
TFFl reverse 5' CCGAGCTCTGGGACTAATCA 3' 

mRNA levels were calculated by the delta-delta Ct 
method [31]. Samples were run in triplicate and standard 
deviation (SD) is the average of 3 independent experiments. 

Study populations 

All women with a biopsy confirming breast cancer at 
Galway Hospital and its affiliates are approached to en- 
roll in the breast cancer study including DNA collection. 
Informed consent, a detailed family history of breast and/ 
or ovarian cancer and a peripheral venous blood sample 
are obtained from cases and controls. Controls were 
women from the west of Ireland, primarily over 60 years 
of age, without a personal history of cancer of any type 
and without a first-degree family member with breast or 
ovarian cancer. These controls were accrued primarily 
from Active Retirement association meetings and from 
Nursing home residents. All cases and controls were re- 
cruited following appropriate ethical approval from the 
Galway University Ethics Committee. 728 cases and 387 
controls were included from this cohort. 

The Irish patient cohort consisted of 728 women with 
invasive, primary operable breast cancer diagnosed be- 
tween June 1980 and August 2007, with complete receptor 
status (outlined in Additional fOe 1). Receptor status was 
determined using established histopathological methods 
and immunohistochemistry, followed by fluorescence in- 
situ hybridisation (FISH) to confirm HER2/«eM positivity 
in samples that scored a 2+ on Hercept test. The samples 
were then grouped into Luminal A, Luminal B, HER2 and 
triple negative subtypes based on receptor status but in 
the absence of gene expression analysis. Patient demo- 
graphics and tumor characteristics were recorded and out- 
come/survival data was prospectively maintained using 
hospital medical records. Disease free survival (DFS) was 
defined as time in months, from breast cancer diagnosis 
to point of loco/regional recurrence or distant disease pro- 
gression, progression free survival (PES) was defined as 
time in months from the point of diagnosis of Stage IV 
cancer to disease progression and overall survival (OS) 
was defined as the time from breast cancer diagnosis to 
the end of follow-up or death (months). 

The CT Triple Negative Breast Cancer (TNBC) Cohort 
has been previously described [32], but briefly, FFPE tissue 
was obtained from 134 TNBC patients, who underwent 
surgery at the Yale University New Haven Hospital or the 
Hospital of Bridgeport, Connecticut, between 1985 and 
2007. Patient sample collection was performed through a 
Yale HIC approved tissue collection protocol. Tissue of 
120 patients was used for TMA construction and the 
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follow up time for these patients ranged between 3 months 
and 19 years with a mean follow up of 4.4 years. Patient 
age at diagnosis ranged from 30 to 90 years, with a mean 
age at diagnosis of 53 years. Sbcty-two patients were diag- 
nosed as node negative and 40 patients as node positive. 
There were 65 patients who were Caucasian in this cohort 
who were used for this analysis. Treatment was known in 
86% of patients, out of those 63% received chemotherapy. 
Gene expression in TMAs was analyzed by AQUA tech- 
nology [28,29], and results were reviewed and confirmed 
by two independent pathologists. 

The High Risk Breast Program from Vermont is a 
database that is IRB approved and was established at the 
University of Vermont in 2003. Eligible women include 
those with a strong family history of breast cancer (55.2% 
of participants), a prior breast biopsy showing atypical 
ductal hyperplasia or lobular neoplasia (14.7%), a known 
germline abnormality of BRCAl or 2 (7.3%, but excluded 
from this study), or a prior history of receiving chemo- 
radio therapy for Hodgkin's disease (1.3%). At study entry, 
unaffected high-risk women provide anthropometric mea- 
surements, medical/family history, physical activity and 
diet information, mammography reports, health behavior 
information and provide a blood sample for storage that 
may be used for future research. 536 women have been 
enrolled into this database since 2003 with follow-up 
visits, questionnaire completion and blood draws occur- 
ring at 4 and 8 years after study entry. Status of enrolled 
women (i.e., new cancer diagnosis) is updated on an on- 
going annual basis. For this study, 367 women were geno- 
typed for the £7?CAi-variant. 

BI?C/1 7-3'UTR-variant genotyping 

1-3 mL of whole blood was drawn from the Irish cases 
and controls and DNA was isolated. DNA was isolated 
from FFPE tissue for genotyping for the TNBC Cohort. 
DNA was supplied from the Vermont cohort. From blood, 
DNA was isolated using a DNA extraction kit (Centra 
Puregene) or Ambion according to the manufacturers 
protocol. Genotyping was performed using a custom Taq- 
Man genotyping assay (Applied Biosysytems) that was spe- 
cific for rs8176318. Each reaction was performed in a 20 i^l 
volume using 10 |il of 2x TaqMan Genotyping MaterMix, 
1 [il of the 20x variant assay, approximately 40 ng of DNA 
and nuclease firee water in a 96-well plate. The reactions 
were run on the Applied Biosystems 7900HT Fast Real- 
Time PCR System in a two-stage process incorporating 
PCR amplification and allelic discrimination. Genotypes 
were analyzed using the Applied Biosystems SDS 2.3 geno- 
typing software and automatic calls were verified by ob- 
serving the spectral contributions of the dye corresponding 
to the sequence specific probe on the Multicomponent 
Data Plot. Internal quality control was maintained using 
established positive and negative controls to ensure 



genotyping accuracy and 6% percent of DNA samples were 
genotyped in duplicate with 100% consistency of results. 
Two DNA samples of the 728 cases failed to amplify and 
were excluded from further analyses. All Caucasian cases 
from the TNBC cohort amplified and were included in the 
analysis. All BRCA coding sequence non-mutant patients 
from the Vermont cohort were included. 

Statistical analysis 

The genetic distribution of the breast cases and controls 
were tested for Hardy- Weinberg equilibrium and were 
found to be in equilibrium. In order to evaluate the distri- 
bution of patient demographics in cases and controls as 
well as tumor features among the cases, categorical vari- 
ables were analyzed using the test and continuous vari- 
ables were analyzed using t-tests. Binary logistic regression 
was used to evaluate the association of each genotype with 
cancer. Case-control analysis comparing genotypes in dif- 
ferent models was performed using a x test to obtain odds 
ratios (OR), 95% Confidence Interval (CI) and p-values. 
Based on the preceding statistical findings, the dominant 
model was used for all further analyses. 

Prevalence of the variant across cancer subtypes, and 
comparison of the respective subtypes against controls 
were evaluated using x^ analyses. The Luminal A cases 
were stratified according to menopausal status and the 
observed genotype distribution compared with controls 
using X test. Association of the variant with ER/PR sta- 
tus controlling for other patient and tumor variables was 
analyzed using binary logistic regression. 

Binary logistic regression was used to evaluate the inde- 
pendent effect of metastasis and disease stage in predicting 
variant positivity in all cancer cases and Luminal A cases 
specifically. Logistic regression analyses for all cases and 
Luminal A cases with a binary outcome variable coded as 
rs8176318 positive (TT or GT genotypes) or negative (GG 
genotype) included variables such as age at diagnosis, 
menopausal status, tumor grade, ER/PR status and stage. 

Cox Proportional Hazards models were fitted to evalu- 
ate the effect of the variant on disease free survival, pro- 
gression free survival and overall survival in all cancer 
cases and according cancer stage. 

Fisher's Exact Test was used to examine the statistical 
significance of the association between mammographic 
density and the presence or absence of the BRCAl- 
3'UTR-variant in both the entire population, as well as 
in a variety of subsets (BMI categories, pre- or post- 
menopausal women, and age at menarche categories). 

Results 

The BRCAl -3'UTR-variant Is associated with decreased 
gene expression In triple negative breast cancer cell lines 

To evaluate if the fi7?CAi -3'UTR-variant alters BRCAl 
gene expression, we generated and tested luciferase 
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reporters containing either the mutant (T) or wild-type 
(G) BRCAl 3'UTR. Reporters were transfected into vari- 
ous breast cancer cell lines and assayed for differences in 
luciferase gene expression as a surrogate for BRCAl ex- 
pression in the presence or absence of the B7?CAi-3'UTR- 
variant. We found that the reporter with the T-allele had 
decreased luciferase expression by approximately 1.4, 1.5 
and 1.8-fold in BT-20, Hs 578 T and MDA-MB-468 triple 
negative breast cancer cell lines, respectively (Figure 1). 
We found no significant difference in luciferase expression 
between the wild-type (G) and mutant (T) alleles in the 
MDA-MB-361, the MDA-MB-231 or MCF-7 breast can- 
cer cells. 

The BRCAl -3'UTR-variant is associated with decreased 
BRCA1 gene expression in triple negative breast cancer 
patients 

To confirm our in vitro findings, we evaluated BRCAl 
protein expression using our CT cohort of triple negative 
breast cancer patient tumor samples, where BRCAl pro- 
tein staining and the B7?CAJ-3'UTR-variant genotype 
analysis was available. While protein coding sequence 
BRCAl and BRCA2 variant status was unavailable for 
these patients, based on previous work, the BRCAl 
3'UTR variant is rarely found in patients with coding se- 
quence variants [17]. Even without excluding protein 
coding sequence mutants, we found BRCAl expression 
was significantly lower in TNBC tumor cores from pa- 
tients harboring the £7?CAi-3'UTR-variant (TT) alleles 
compared to patients harboring hetero and homozygous 
wild-type (TG and GG) alleles (Figure 2). These findings 
support the hypothesis that the B7?C^J-3'UTR-variant is 



associated with lower BRCAl protein expression in TNBC 
tumors, as was seen in vitro. 

Estrogen withdrawal leads to altered gene expression 
from the BRCAl -3'UTR-variant mutant allele 

Based on our findings suggesting that at baseline the 
Bi?CAi-3'UTR-variant led to differential BRCA expres- 
sion, we next tested the hypothesis that the BRCAl- 
3'UTR-variant T-allele could be differentially regulated 
by external cellular events. We chose to study the im- 
pact of the presence or absence of estrogen, based on its 
association with altered expression in TNBC cell lines 
and tumors. We therefore measured the impact of estro- 
gen withdrawal on our mutant and wild-type luciferase 
reporters. MCF-7 cells cultured in fulvestrant (an anti- 
estrogen) or estrogen for 48-hours were transfected with 
luciferase reporters harboring either the wild-type (G) or 
mutant (T) BRCAl 3'UTR. 

We found that estrogen withdrawal resulted in a sig- 
nificant decrease in the expression of the mutant allele, 
without any impact on the wild-type G-allele, indicating 
that estrogen withdrawal differentially impacts the ex- 
pression of the T-allele, leading to down-regulation of 
luciferase expression in the absence of estrogen. In con- 
trast, we found that the addition of estrogen had no ef- 
fect on either the non-mutant (G-allele) or mutant 
allele (T-allele) (Figure 3A). Estrogen depletion was con- 
firmed by RT-PCR analysis of previously described es- 
trogen responsive genes GREBl [33] and TFFl [34], 
which displayed a 10-fold and 7-fold decrease in mRNA 
expression (respectively) in cells treated with fulvestrant 
(Figure 3B). 




MCF-7 MDA-MB-361 MDA-MB-231 Hs578T BT-20 MDA-MB-468 



Figure 1 The impact of the B/?C/l7-3'UTR-variant on luciferase expression in breast cancer cell lines. Dual luciferase reporters liarboring 
either the non-variant (G-allele, dark grey) or variant (T-allele, light grey) BRCAl 3'UTR were transiently transfected into various breast cancer cell 
lines (as indicated). Following a 16-hour incubation the cells were lysed and luciferase activities were analyzed. Renilla luciferase was normalized 
to firefly luciferase. T-allele expression was calculated relative to that of the G-allele. Plotted is the mean and standard deviation of 4 independent 
experiments. *p < 0.05; error bars represent the mean ± standard deviation. 
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TT GT GG 

Figure 2 The SfiCA 7-3'UTR-variant and BRCA1 protein staining in CT TNBC patient tumor cores. Comparison of tlie degree of BRCAl 
protein staining in a IMA according to respective alleles. A lower level of BRCAl staining was noted in the homozygous mutant specimens (JT). 



The association of the BRCAl -3'UTR-variant with breast 
cancer risk 

To determine if there were clinical and biological im- 
pacts of the 57?CA J -3'UTR-variant, we studied a genetic- 
ally and environmentally homogeneous population, to 
best control for "context" effects on variant function. We 
used our case-control analysis of 726 cases and 387 con- 
trols from west-Ireland. Clinico-pathological variables of 
breast cancer cases evaluated in this study and their asso- 
ciation with the variant are in Additional file 1. Overall, 
there was a significant difference in the distribution of the 
three genotypes across cases and controls (p = 0.033), with 
a higher proportion of cases displaying the mutant TT and 
GT genotypes (60[8%] and 318[44%] of 726 cases respect- 
ively) compared to controls (29 [7%] and 141 [36%] of 387 
controls respectively). The dominant model was predictive 
of breast cancer risk compared to controls for all breast 
cancer patients (OR 1.4, 95% CI 1.1-1.8). 

We next evaluated the association of the BRCAl- 
3'UTR-variant across the various breast cancer subtypes. 
Our cohort was comprised of 519 women with Luminal 
A breast cancer, 84 with Luminal B disease, 40 with HER2 
positive disease and 83 with triple negative breast cancer. 
378 (52%) of the 726 breast cancer cases had the variant, 
with similar prevalence between the subtypes (p = 0.392): 
Luminal A (279 [54%] of 519 cases). Luminal B (37 [44%] 
of 84 cases), HER2 (21 [53%] of 40 cases) and triple nega- 
tive breast cancer (41 [49%] of 83 cases). Comparing the 



prevalence of the 5i?CAi-3'UTR-variant within respective 
subtypes with controls. Luminal A breast cancer was most 
strongly associated with the variant by the dominant 
model (OR =1.5, 95% CI 1.1-1.9). This association was 
not seen with the other subtypes (Additional file 2), but 
this was likely due to sample size. 

Previous work indicated that the homozygous (TT) 
mutant genotype was associated with triple negative 
breast cancer in African American patients [17]. There- 
fore, we evaluated the association of patient/ tumor fea- 
tures (age, menopausal status, stage, ER/PR status, and 
tumor grade) with the homozygous TT variant compared 
to hetero TG or homozygous GG alleles in all Irish cases. 
In agreement with this prior study, Irish Caucasian pa- 
tients with ER/PR negative disease were 2.2 times more 
likely to carry the homozygous (TT) rs8176318 variant, 
which was of borderline significance (95% CI 0.98-4.87, 
p = 0.056). 

The association of the BRCAl -3'UTR-variant with tumor 
biology and clinical presentation 

We next tested the hypothesis that the B7?CA J -3'UTR- 
variant may predict altered breast cancer biology in our 
Irish cohort of patients. We found that both disease 
stage (p = 0.015) and presence of distant metastasis at 
presentation (p = 0.037) were significant predictors of the 
B7?CAJ-3'UTR-variant. Regression analyses of all breast 
cancer cases evaluating the contributory role of age. 
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Figure 3 Expression of the S/?C/17-3'UTR-variant mutant allele with estrogen withdrawal. A. MCF-7 cells treated with either 100 nM 
fulvestrant or estrogen for 48-hours, were transfected with dual luciferase reporter plasmids harboring either the non-variant (G-allele, dark grey) 
or variant (T-allele, light grey) BRCAl 3'UTR. After a 16-hour incubation dual luciferase activities were nneasured. Renilla luciferase was normalized 
to firefly luciferase. T-allele expression was calculated to that of the G-allele. Plotted is the mean and standard deviation of at 3 independent 
experiments. *p < 0.05; error bars represent the mean ± standard deviation. B. Total RNA was isolated from cell lysates (A) by Trizol extraction. 
RT-qPCR was utilized to access the effects of fulvestrant and estrogen treatment on mRNA expression of estrogen responsive markers (GREB1 and 
TFF1). The results were normalized to |3-Actin mRNA expression. *p < 0.05; error bars represent the mean ± standard deviation. 



menopausal status, tumor grade, stage and ER/PR status 
in predicting the 5i?C4i-3'UTR-variant was significant 
only for stage (Table 1). Moreover, patients with meta- 
static disease (n = 23) at presentation had a four-fold risk 
of carrying the £7?CAi-3'UTR-variant compared to Stage 
I breast cancer patients (p = 0.018, OR 3.73, 95% CI 1.26- 
11.07). Put differently, 17 (73%) of the 23 patients with 
metastatic disease at presentation were positive for the 
57?C^J-3'UTR-variant, compared to 349 (51%) of 680 
patients without metastatic lesions (p = 0.040, OR 2.7, 
95% CI 1.1-6.9) (Table 2). Controlling for other disease 



variables in a multi-variant model, patients with Stage 
IV disease were three-fold more likely to have the BRCAl- 
3'UTR-variant compared to all other stages of breast can- 
cer (p = 0.055, OR 2.76, 95% CI 1.0-7.8). 

We further performed regression analysis of Luminal 
A cases alone, evaluating the effect of patient age, meno- 
pausal status, disease stage, tumor stage and grade on 
the B7?CAi -3'UTR- variant status. Again we found that the 
B7?CAJ-3'UTR-variant was significant for disease stage. 
Patients presenting with Stage IV disease with Luminal A 
breast cancer had a 10-fold increased risk of carrying the 
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Table 1 Multivariate analysis evaluating the role of patient 
and pathological factors on the 6/?C/l7-3'UTR-variant 


Table 3 Genotype distribution across mammographic 
density Vermont cohort 


positivlty 










Mammographic 


rs8176318 


rs8176318 


p-value* 


Multivariate analysis 




p-value 


OR 


95% CI 


density <50% 


positive 


negative 




Age 




0.915 


1 


0.98-1.02 


Full cohort 


93 


70 


0.2948 


Menopausal status: 


Post 


0.492 


1.29 


0.62-2.69 


Pre-menopausa 


49 


39 


0.7842 


Grade 


2 


0.493 


0.83 


0.49-141 


Post-menopausa 


35 


28 


0.2530 




3 


0.145 


0.66 


0.38-1.16 


Normal BMI 


26 


22 


0.7342 


Stage 


2 


0.065 


1.48 


0.98-2.24 


Overweight BMI 


29 


20 


0.3115 




3 


0.169 


141 


0.87-2.29 


Obese BMI 


36 


26 


0.0398 




4 


0.018 


3.73 


1.26-11.07 


Menarche age 7-1 1 


16 


17 


0.7848 


ER and/or PR status: 


Positive 


0.603 


0.88 


0.56-141 


Menarche age 12-13 


48 


33 


0.4672 












Menarche age >14 


18 


9 


02123 



variant compared to patients with Stage I disease (p = 
0.033, OR 10.05, 95% CI 1.21-83.52). Presence of distant 
metastasis at presentation was independently associated 
with variant positivity, as 11 (92%) of 12 Luminal A pa- 
tients with metastasis had the variant compared to 261 
(53%) of 495 patients without metastasis (p = 0.029, OR 
9.9, 95% CI 1.3-77.0) (Table 2). Controlling for other con- 
founding pathological factors in a multivariant model, 
Stage IV disease in Luminal A cases was again associated 
with the B7?CAJ-3'UTR-variant compared to all other 
stages (p = 0.053, OR 7.78, 95% CI 1.0-62.3). In contrast, 
we found no difference between disease free survival, pro- 
gression free or overall survival using either a recessive or 
dominant model (data not shown). 

The BRCAI-3'UTR-variant is not associated with features 
predicting difficulty In detection 

As mammographic screening is initiated at the age of 50 
in Ireland, and all women standardly participate, we 
tested the hypothesis that this mutation might predict 
difficulty successfully detecting breast cancer using stand- 
ard screening in these patients, explaining the association 
with Stage IV presentation. We therefore studied a cohort 
of women at high risk for breast cancer, with detailed in- 
formation collected prospectively on health, screening and 
outcomes from Vermont. Out of this cohort of 369 tested 
women, 199 had the BRCAl 3'UTR variant. As dense 
breast tissue predicts increased difficulty in tumor detec- 
tion [35-37], we examined features of mammographic 



*P-values represent significance of the association between mammographic 
density and the presence or absence of rs8176318. 



density in this population. We found that women with the 
BRCAl 3'UTR variant were actually significantly less likely 
to have dense breast tissue compared to non-BRCAl 
3'UTR mutant patients, when they had an obese BMI 
(p = 0.0398) (Table 3). To better understand this, we an- 
alyzed the relationship between each density category 
and the BRCAl 3'UTR variant. We found that women 
with the BRCAl 3'UTR variant were less likely to have 
mammograms with scattered fibroglandular density (p = 
0.1397) (Table 4 and Additional file 3), which contributes 
to density. These findings suggest that mammographic 
screening for women with this variant should be at least 
as successful in detecting disease as in women without 
this variant, and thus failed screening is not the explan- 
ation for presentation with Stage IV disease. 

Discussion 

Here we show for the first time that the rs8176318 G > T 
3'UTR variant (the fi7?CAi -3'UTR- variant) is associated 
with decreased BRCAl expression both in vitro and 
in vivo, and is impacted by cellular exposure to estrogen. 
More importantly, we show that this variant predicts ag- 
gressive breast cancer biology and stage IV disease, as 
well as modest increased breast cancer risk in a homo- 
geneous well-characterized west-Irish population. In 
addition, studying a collection of women at high risk for 
breast cancer, we found that this variant is associated 



Table 2 Genotype distribution across metastasis status 



All cases 



TT and GT 
GG 

Luminal A cases 

TT and GT 
GG 



Ml 



MO 



p-value 



17 



11 
1 



349 
331 



261 
234 



0.040 



0.029 



OR (95% CI) 



2.69 (1.05-6.90) 



9.86 (1.26-77.01) 



Table 4 Genotype distribution across fibroglandular 
status 



Mammographic density 
categories 


rs8176318 
positive 


rs8 176318 
negative 


p-value* 


Extremely dense 


17 


18 


0.5936 


Heterogeneously dense 


89 


82 


0.5306 


Scattered fibroglandular 


66 


44 


0.1397 


Fatty replaced 


27 


26 


0.6576 
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with features usually considered to improve the ability 
of mammograms to detect disease (lower mammo- 
graphic density). These findings suggest that presenta- 
tion with stage IV disease of 57?G4i-3'UTR-variant 
patients is unlikely to be due to the inability to detect 
disease early, but instead suggests that this variant pre- 
dicts biologically aggressive disease. These are hypoth- 
esis deserving further investigation. 

While the findings of increased cancer risk are in 
agreement with prior reports [16,17], this is the first 
study evaluating biologic function and clinical associa- 
tions of the B7?CAi-3'UTR-variant with the patients that 
are carriers and develop cancer. While the search for 
germ-line variants in BRCAl have predominantly focused 
on open-reading-frame variants, increasing evidence is 
showing that alterations in non-coding regions of genes 
(such as this variant) explain a proportion of cancer sus- 
ceptibility, and more importantiy play a role in tumor 
biology and can act as prognostic biomarkers. While the 
exact biological mechanism leading to altered BRCAl ex- 
pression in fi7?G4i-3'UTR-variant associated tumors is 
unknown, it is predicted to be a miRNA binding site of 
miR-20a-3p and miR-5001-3p by target prediction pro- 
grams including MirSNP and PolymiRTS, and was shown 
previously to be impacted by miRNA targeting [16]. We 
hypothesize that this may be more complex, with this re- 
gion potentially being a landing dock for other RNA bind- 
ing proteins, and is work that is ongoing but outside of 
the scope of this proposal. 

Diminished expression of BRCAl has previously been 
shown to increase the growth rate of benign and malig- 
nant breast tissue [38,39]. In another study, loss of nuclear 
BRCAl expression (using IHC) was significandy associ- 
ated with high histological grade (p < 0.025) (p < 0.05) 
[40]. Both of these findings could help explain the associ- 
ation of the B7?C4J-3'UTR-variant with tumor progres- 
sion and aggressive phenotype. Interestingly, low BRCAl 
mRNA expression identified in sporadic breast cancer 
specimens has been associated with development of dis- 
tant metastasis (p = 0.019) and a shorter disease free inter- 
val (p = 0.015) [41]. Additionally, Japanese women whose 
tumors stained negative for BRCAl expression had worse 
disease free survival than similar patients whose tumors 
were positive for BRCAl staining [42]. Overall, these find- 
ings are in agreement with our findings regarding the 
57?C4i-3'UTR-variant, that reduced BRCAl expression in 
the absence of germ-line protein coding sequence variants 
may be associated with aggressive tumor biology. 

Although the B7?CA J -3'UTR-variant has now been 
shown to predict a significant increased risk of breast 
cancer risk in three independent well-characterized co- 
horts, it is notable that this variant has not been reported 
from GWAS analyses. We hypothesize that this may be 
partly due to the association of the £7?CAi-3'UTR-variant 



with advanced disease presentation, as patients with Stage 
IV cancer are generally underrepresented in cohorts that 
are not comprehensive sequential patient collections, such 
as the one used in this study, as well as in the Pelletier 
triple negative cohort study [17]. Another possibility is 
that because this variant, similar to other identified 3'UTR 
variants, is altered by "context", in this case estrogen, 
which will be altered by body habitus as well as the soci- 
etal acceptance of hormone replacement therapy, it would 
make it more likely to be missed in mixed populations 
such as those used in GWAS studies. For this new class of 
mutation, 3'UTR variants, the homogeneity and appropri- 
ate characterization of the study sample is likely to be 
much more important than simple sample size. 

Our findings suggest a hypothesis where in women 
with the B7?CAi-3'UTR-variant, if progressing to an es- 
trogen independent phenotype, their BRCAl becomes 
even less functional, possibly allowing more DNA dam- 
age, and perhaps selection for a more aggressive breast 
cancer genotype. These findings could also indicate that 
the £7?G4i-3'UTR-variant becomes the greatest risk for 
cancer development at the time of estrogen withdrawal, 
or menopause. While the steps required to lead to breast 
tumorigenesis in these patients will require studies with 
in vitro and in vivo models, this work represents a sig- 
nificant step forward in generating hypotheses about this 
variant, as well as understanding the role of this variant, 
and other such variants, in cancer biology. 

Conclusion 

Here we show for the first time that the BRCAl-3'UTR- 
variant predicts Stage IV disease, likely due to aggressive 
tumor biology. The discovery of a meaningful clinical as- 
sociation of the B7?CAi-3'UTR-variant in breast cancer 
further highlights the importance of studying such vari- 
ants in appropriate cohorts to better understand their 
clinical potential. 
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Additional file 1: Clinicopathological characteristics of breast 
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